Artificial Intelligence-Enabled Management of Storage Media Access

ABSTRACT

The present disclosure describes apparatuses and methods for artificial intelligence-enabled management of storage media. In some aspects, a media access manager of a storage media system receives, from a host system, host input/output commands (I/Os) for access to storage media of the storage media system. The media access manager provides information describing the host I/Os to an artificial intelligence engine and receives, from the artificial intelligence engine, a prediction of host system behavior with respect to subsequent access of the storage media. The media access manager then schedules, based on the prediction of host system behavior, the host I/Os for access to the storage media of the storage system. By so doing, the host I/Os may be scheduled to optimize host system access of the storage media, such as to avoid conflict with internal I/Os of the storage system or preempt various thresholds based on upcoming idle time.

CROSS REFERENCE TO RELATED APPLICATION

This application is a continuation of and claims priority to U.S.Non-Provisional patent application Ser. No. 17/320,096 filed May 13,2021, which in turn is a continuation of and claims priority to U.S.Non-Provisional patent application Ser. No. 16/664,528, filed on Oct.25, 2019, which is now U.S. Pat. No. 11,010,314, granted on May 18,2021, which in turn claims priority to U.S. Provisional PatentApplication Ser. No. 62/752,876 filed Oct. 30, 2018, the disclosures ofwhich are incorporated by reference herein in their entireties.

BACKGROUND

Many computing and electronic devices include non-volatile memory forstoring software, applications, or data of the device. Additionally,most users stream data or access services with their devices, such asmultimedia content or social media applications, over data networks fromvarious locations or on the move. With users' ever-increasing demand fordata and services, storage providers have scaled up capacity andperformance of storage drives to support the data access associated withthese activities of users and other data storage clients. Typically, astorage drive of a device includes storage media to which data of thedevice is written and read from. To do so, the device may issue dataaccess requests to the storage drive, which in turn writes the data toor reads the data from the storage media as specified by each request.Thus, storage drive performance generally depends on a rate at which thestorage drive is able to complete the data access requests of the deviceor the storage client.

The storage media of the storage drive is not accessed solely based onthe data access requests received from the device. The storage driveitself may implement various internal operations related to health ormaintenance of the storage media. In conventional storage drives, accessto the storage media associated with these internal drive operations isnot planned and may conflict with access of the storage media forservicing the data requests of the device. Accordingly, when theinternal operations of the storage drive result in conflicting accessthat interferes with data write operations or data read operations ofthe device, overall storage drive performance may degrade as datarequest latency increases and data throughput of the storage drivedecreases.

SUMMARY

This summary is provided to introduce subject matter that is furtherdescribed in the Detailed Description and Drawings. Accordingly, thisSummary should not be considered to describe essential features nor usedto limit the scope of the claimed subject matter.

In some aspects, a media access manager of a storage media systemimplements a method that receives, from a host system and via a hostinterface of a storage system, host input/outputs (I/Os) for access tostorage media of the storage system. The media access manager providesinformation describing the host I/Os received from the host system to anartificial intelligence engine associated with the storage system. Themedia access manager receives a prediction of host system behavior withrespect to subsequent access of the storage media by the host systemfrom the artificial intelligence engine. Based on the prediction of hostsystem behavior, the media access manager schedules the host I/Os foraccess to the storage media of the storage system.

In other aspects, an apparatus comprises a host interface configured forcommunication with a host system, storage media to store data of thehost system, and a media interface configured to enable access to thestorage media. The apparatus also includes an artificial intelligenceengine and a media access manager that is configured to receive, via thehost interface, host input/outputs (I/Os) from the host system foraccess to the storage media of the apparatus. The media access managerprovides information describing the host I/Os received from the hostsystem to the artificial intelligence engine. The media access managerthen receives a prediction of host system behavior with respect tosubsequent access of the storage media by the host system from theartificial intelligence engine. Based on at least the prediction of hostsystem behavior, the media access manager schedules the host I/Os foraccess to the storage media of the apparatus.

In yet other aspects, a System-on-Chip (SoC) is described that includesa media interface to access storage media of a storage system, a hostinterface to communicate with a host system, and an artificialintelligence engine. The SoC also includes a hardware-based processorand a memory storing processor-executable instructions that, responsiveto execution by the hardware-based processor, implement a media accessmanager to receive host input/outputs (I/Os) from the host system foraccess to the storage media of the storage system via the hostinterface. The media access manager provides information describing thehost I/Os received from the host system to the artificial intelligenceengine. The media access manager then receives a prediction of hostsystem behavior with respect to subsequent access of the storage mediaby the host system from the artificial intelligence engine. Based on atleast the prediction of host system behavior, the media access managerschedules the host I/Os for access to the storage media of the storagesystem.

The details of one or more implementations are set forth in theaccompanying drawings and the following description. Other features andadvantages will be apparent from the description and drawings, and fromthe claims.

BRIEF DESCRIPTION OF THE DRAWINGS

The details of one or more implementations of an artificialintelligence-enabled (AI-enabled) management of storage media access areset forth in the accompanying figures and the detailed descriptionbelow. In the figures, the left-most digit of a reference numberidentifies the figure in which the reference number first appears. Theuse of the same reference numbers in different instances in thedescription and the figures indicates like elements:

FIG. 1 illustrates an example operating environment having devices inwhich an AI-enabled storage media controller is implemented inaccordance with one or more aspects of the disclosure.

FIG. 2 illustrates example configurations of a media access manager andan AI engine of the storage controller shown in FIG. 1 .

FIG. 3 illustrates example configurations of various hardware andfirmware components for implementing an AI engine of a storage mediacontroller.

FIG. 4 illustrates an example configuration of an AI engine andpersistent memory for implementing multiple AI models.

FIG. 5 illustrates example configurations of a cache memory and astorage memory through which aspects of AI-enabled management of storagemedia access may be implemented.

FIG. 6 illustrates an example of predictive garbage collectionimplemented with an AI engine of a storage system controller.

FIG. 7 illustrates an example of internal I/O operations scheduled by anAI engine in accordance with one or more aspects of the disclosure.

FIG. 8 illustrates example host I/O event types that are useful to an AImodel predicting host system behavior in accordance with one or moreaspects of the disclosure.

FIG. 9 illustrates an example implementation of an AI model configuredto predict various aspects of host I/O behavior.

FIG. 10 illustrates examples of predicted host behavior includingvarious I/Os or idle time.

FIG. 11 illustrates an example implementation of an AI model thatsupports multi-stage prediction of host behavior.

FIG. 12 illustrates an example beam search useful to determine a path ofpredicted host behavior based on event probability.

FIG. 13 illustrates an example timeline of operations for onlinere-training or online refinement of an AI model.

FIG. 14 depicts an example method for AI-enabled management of storagemedia access in accordance with one or more aspects of the disclosure.

FIG. 15 depicts an example method for delaying an internal operation ofa storage system based on a prediction of host system behavior.

FIG. 16 depicts an example method for advancing an internal operation ofa storage system based on a prediction of host system behavior.

FIG. 17 depicts an example method for altering threshold fordevice-level management based on a prediction of host system behavior.

FIG. 18 depicts an example method for running inferences with multipleinstances of an AI model to enable online re-training or refinement.

FIG. 19 illustrates an example System-on-Chip (SoC) environment in whichaspects of AI-enabled management of storage media access may beimplemented.

FIG. 20 illustrates an example storage system controller in which an AIengine is implemented in accordance with one or more aspects of thedisclosure.

DETAILED DESCRIPTION

Conventional techniques for managing access to storage media oftenresult in conflicted or inefficient storage media access that degradesstorage drive performance. Generally, storage drive firmware is used tomanage a data path of the storage drive in an end-to-end manner, such asby translating data commands received from a device requesting access todata on the storage media. During operation, if storage media health orstorage media maintenance issues arise, the firmware of the storagedrive typically schedules storage media access to facilitate theseinternal tasks against the storage media access associated with the datacommands of the device. The data commands of the device, which utilizeresources of the storage media, may also further affect any ongoinghealth or maintenance issues of the storage media. In other words, theconventional techniques often schedule access to the storage media basedon a current state of storage media health or storage media maintenance,which is a short-term or narrow view of storage drive needs andperformance. As such, the conventional techniques for managing accessoften result in conflicted or inefficient access of the storage media,which degrades or reduces performance of the storage drive.

This disclosure describes apparatuses and techniques for AI-enabledmanagement of storage media access. In contrast with conventionaltechniques of storage media access, the described apparatuses andtechniques may implement AI-enabled-management of storage media accessfor efficient and coordinated scheduling of host input/outputs (I/Os) orinternal I/Os for optimized storage drive performance. In some aspects,firmware of a storage controller (a media access manager) may useartificial intelligence (AI) models, running on native or dedicated AIhardware and/or firmware, for predictions or forecasts of activity thatare useful by the firmware to manage storage media access and improvestorage drive performance. For example, an AI engine using the AI modelsmy predict relevant external events (e.g., host system activity) forinternal firmware/software of the storage controller. The AI-enabledstorage controller may determine when to perform internal storagecontroller-related tasks based on the predicted external events, ratherthan only a current device state or history of past events.

Generally, aspects of AI-enable management of storage media access mayimplement intelligent predictive and adaptive scheduling of host I/Osand/or internal I/Os through predicted host system behavior for astorage system (e.g., storage drive). In some cases, this may enableinternal I/O scheduling that is optimized through AI-assisted predictionof events. In other cases, a media access manager may preempt, suspend,or disregard a threshold (e.g., garbage collection or thermal limit) toallow performance of host I/Os based on an upcoming idle time orprojected decrease in host system activity. The media access manager andAI engine may also optimize various Flash translation layer (FTL)management operations (e.g. caching, migration, or garbage collection),as well as device-level tasks that include thermal management or powermanagement for storage media devices.

In various aspects, the AI engine and AI models of the storagecontroller may predict specific idleness-related events (e.g., time tonext idle, next idleness duration) of the host system, as well as hostsystem I/O behavior (e.g., write density) or other parameters of storagemedia access. Generally, the AI engine of the storage controller mayimplement or manage multiple AI models, which may be loaded to AIhardware or AI firmware based on various internal tasks of a storagesystem that would benefit from AI assistance. In some cases, thesemultiple or different AI models run or execute concurrently on native orAI hardware to provide simultaneous AI assistance for multiple internaltasks. Alternately or additionally, the AI engine may perform online(run-time) re-training or refinement of an AI Model enable dynamicadaptation to user- or host system-specific I/O workloads.

In various aspects of AI-enabled management of storage media access, amedia access manager of a storage media system receives host I/Os foraccess to storage media of the storage media system from a host system.The media access manager provides information describing the host I/Osreceived from the host system to an artificial intelligence engine. Themedia access manager then receives a prediction of host system behaviorwith respect to subsequent access of the storage media from theartificial intelligence engine. Based on the prediction of host systembehavior, the media access manager then schedules the host I/Os foraccess to the storage media of the storage system. By so doing, the hostI/Os may be scheduled to optimize host system access of the storagemedia, such as to avoid conflict with internal I/Os of the storagesystem or preempt various thresholds or parameters based on upcomingidle time.

The following discussion describes an operating environment, techniquesthat may be employed in the operating environment, and a System-on-Chip(SoC) in which components of the operating environment may be embodied.In the context of the present disclosure, reference is made to theoperating environment by way of example only.

Operating Environment

FIG. 1 illustrates an example operating environment 100 having a hostsystem 102, capable of storing or accessing various forms of data orinformation. Examples of a host system 102 may include a laptop computer104, desktop computer 106, and server 108, any of which may beconfigured as user device, computing device, or as part of a storagenetwork or cloud storage. Further examples of host system 102 (notshown) may include a tablet computer, a set-top-box, a data storageappliance, wearable smart-device, television, content-streaming device,high-definition multimedia interface (HDMI) media stick, smartappliance, home automation controller, smart thermostat,Internet-of-Things (IoT) device, mobile-internet device (MID), anetwork-attached-storage (NAS) drive, aggregate storage system, gamingconsole, automotive entertainment device, automotive computing system,automotive control module (e.g., engine or power train control module),and so on. Generally, the host system 102 may communicate or store datafor any suitable purpose, such as to enable functionalities of aparticular type of device, provide a user interface, enable networkaccess, implement gaming applications, playback media, providenavigation, edit content, provide data storage, or the like.

The host system 102 includes a processor 110 and computer-readable media112. The processor 110 may be implemented as any suitable type or numberof processors, either single-core or multi-core, for executinginstructions or commands of an operating system or other applications ofthe host system 102. The computer-readable media 112 (CRM 112) includesmemory (not shown) and a storage system 114 of the host system 102. Thememory of the host system 102 may include any suitable type orcombination of volatile memory or nonvolatile memory. For example, thevolatile memory of host system 102 may include various types ofrandom-access memory (RAM), dynamic RAM (DRAM), static RAM (SRAM) or thelike. The non-volatile memory may include read-only memory (ROM),electronically erasable programmable ROM (EEPROM) or Flash memory (e.g.,NAND Flash). These memories, individually or in combination, may storedata associated with a user, applications, and/or an operating system ofhost system 102.

The storage system 114 of the host system 102 may be configured as anysuitable type of data storage system, such as a storage device, storagedrive, storage array, storage volume, or the like. Although describedwith reference to the host system 102, the storage system 114 may alsobe implemented separately as a standalone device or as part of a largerstorage collective, such as a network-attached storage device, externalstorage drive, data center, server farm, or virtualized storage system(e.g., for cloud-based storage or services). Examples of the storagesystem 114 include a non-volatile memory express (NVMe) solid-statedrive 116, a peripheral component interconnect express (PCIe)solid-state drive 118, a solid-state drive 120 (SSD 120), and a storagearray 122, which may be implemented with any combination of storagedevices or storage drives.

The storage system 114 includes storage media 124 and a storage mediacontroller 126 (storage controller 126) for managing various operationsor functionalities of the storage system 114. The storage media 124 mayinclude or be formed from non-volatile memory devices on which data 128or information of the host system 102 is stored. The storage media 124may be implemented with any type or combination of solid-state memorymedia, such as Flash, NAND Flash, RAM, DRAM (e.g., for caching), SRAM,or the like. In some cases, the data 128 stored to the storage media 124is organized into files of data (e.g., content) or data objects that arestored to the storage system 114 and accessed by the host system 102.The file types, sizes, or formats of the data 128 may vary depending ona respective source, use, or application associated with the file. Forexample, the data 128 stored to the storage system 114 may include audiofiles, video files, text files, image files, multimedia files,spreadsheets, and so on. Although described with reference tosolid-state memory, aspects of AI-enabled management of storage mediaaccess may also be implanted with magnetic-based or optical-based mediatypes.

Generally, the storage controller 126 manages operation of the storagesystem 114 and enables the host system 102 to access the storage media124 for data storage. The storage controller 126 may be implementedthrough any suitable combination of hardware, firmware, or software toprovide various functionalities of the storage system 114. The storagecontroller 126 may also manage or administrate internal tasks oroperations associated with the storage media 124, such as data caching,data migration, garbage collection, thermal management (e.g.,throttling), power management, or the like. As such, the storagecontroller 126 may receive host I/Os from the host system 102 for dataaccess and queue (or generate) internal I/Os associated with internaloperations for the storage media 124. Generally, the storage controller126 may perform media I/Os for access of the storage media 124 thatcorrespond to scheduled host I/Os for data access and/or internal I/Osfor internal operations or tasks associated with the storage media 124.

In this example, the storage controller 126 also includes a storagemedia access manager 130 (media access manager 130), an artificialintelligence engine 132 (AI engine 132) and artificial intelligencemodels 134 (AI models 134). In other configurations, the storagecontroller 126 may have access to an AI engine 132 or AI models 134 thatare implemented separately from the storage controller 126. In variousaspects, the media access manager 130 uses the AI engine 132 and AImodels 134 to obtain predictions or forecasts of activity (e.g., hostactivity or inactivity) that are useful to manage storage media access,perform internal tasks, and improve storage drive performance.Generally, the media access manager 130 may implement intelligentpredictive and adaptive scheduling (e.g., advance or delay) of host I/Osand/or internal I/Os through predicted host system behavior for astorage system (e.g., storage drive). In some cases, this may enableinternal I/O scheduling that is optimized through AI-assisted predictionof events.

For example, the media access manager 130 may provide indications ofhost I/Os for storage media access to the AI engine 132. The AI engine132 may use the indications of host I/O activity as inputs for the AImodels 134 to predict or forecast subsequent host system behavior. Basedon this prediction of host system behavior, the media access manager mayschedule the host I/O and/or internal I/Os of the storage system 114, aswell as adapt to user-specific workloads to improve performance of thestorage system. The AI models 134 may include any suitable type ofmodels, such as AI models based on recurrent neural network (RNN)architecture. AI models with an RNN type architecture may be configuredwith a memory for processing a history of inputs, making these modelswell suited for predicting host system behavior or future activity. AnAI model 134 may predict host behavior through any suitable parameter,such as idleness-related parameters (e.g., time to next idle or nextidle duration) or write density parameters (e.g., how much data is thehost system expected to write for a given duration). How the mediaaccess manager 130, AI engine 132, and AI models 134 are implemented andused varies and is described throughout the disclosure.

The host system 102 may also include I/O ports 136, a graphicsprocessing unit 138 (GPU 138), and data interfaces 140. Generally, theI/O ports 136 allow a host system 102 to interact with other devices,peripherals, or users. For example, the I/O ports 136 may include or becoupled with a universal serial bus, human interface devices, audioinputs, audio outputs, or the like. The GPU 138 processes and rendersgraphics-related data for host system 102, such as user interfaceelements of an operating system, applications, or the like. In somecases, the GPU 138 accesses a portion of local memory to render graphicsor includes dedicated memory for rendering graphics (e.g., video RAM) ofthe host system 102.

The data interfaces 140 of the host system 102 provide connectivity toone or more networks and other devices connected to those networks. Thedata interfaces 140 may include wired interfaces, such as Ethernet orfiber optic interfaces for communicated over a local network, intranet,or the Internet. Alternately or additionally, the data interfaces 140may include wireless interfaces that facilitate communication overwireless networks, such as wireless LANs, wide-area wireless networks(e.g., cellular networks), and/or wireless personal-area-networks(WPANs). Any of the data communicated through the I/O ports 136 or thedata interfaces 140 may be written to or read from the storage system114 of the host system 102 in accordance with one or more aspects ofAI-enabled management of storage media access.

FIG. 2 illustrates example configurations of a media access manager 130and AI engine 132 generally at 200, which are implemented in accordancewith one or more aspects of AI-enabled management of storage mediaaccess. In this example, the media access manager 130 and AI engine 132are illustrated in the context of a storage system 114 that isimplemented as a solid-state storage drive (SSD) 202. The SSD 202 may becoupled to any suitable host system 102 and implanted with storage media124 that includes multiple NAND Flash devices 204-1 through 204-n, wheren is any suitable integer. In some cases, the NAND Flash devices 204include multiple Flash channels of memory devices, dies, or chips thatmay be accessible or managed on a channel-level (group of devices) ordevice-level (individual devices). Although illustrated as components ofthe SSD 202, the media access manager 130 and/or AI engine 132 may beimplemented separately from or external to a storage system 114. In somecases, the media access manager 130 or AI engine 132 are implemented aspart of a storage media accelerator or aggregate storage controllercoupled between a host system 102 and one or more storage systems 114.

Generally, operations of the SSD 202 are enabled or managed by aninstance of the storage controller 126, which in this example includes ahost interface 206 to enable communication with the host system 102 anda media interface 208 to enable access to the storage media 124. Thehost interface 206 may be configured to implement any suitable type ofstorage interface or protocol, such as serial advanced technologyattachment (SATA), universal serial bus (USB), PCIe, advanced hostcontroller interface (AHCI), NVMe, NVM-over Fabric (NVM-OF), NVM hostcontroller interface specification (NVMHCIS), small computer systeminterface (SCSI), serial attached SCSI (SAS), secure digital I/O (SDIO),Fibre channel, any combination thereof (e.g., an M.2 or next generationform-factor (NGFF) combined interface), or the like. Alternately oradditionally, the media interface 208 may implement any suitable type ofstorage media interface, such as a Flash interface, Flash bus channelinterface, NAND channel interface, physical page addressing (PPA)interface, or the like.

In various aspects, components of the SSD 202 or storage controller 126provide a data path between the host interface 206 to the host system102 and the media interface 208 to the storage media 124. In thisexample, the storage controller 126 includes processor cores 210 forexecuting a kernel, firmware, or a driver to implement functions of thestorage controller 126. In some cases, the processor cores 210 may alsoexecute processor-executable instructions to implement the media accessmanager 130 or the AI engine 132 of the storage controller 126.Alternately or additionally, the media access manager 130 or the AIengine 132 may execute from or run on AI-specific hardware or processorcores.

As shown in FIG. 2 , a fabric 212 of the storage controller 126, whichmay include control and data buses, operably couples and enablescommunication between the components of the storage controller 126. Forexample, the media access manager 130 or AI engine 132 may communicatewith the host interface 206, processor cores 210 (e.g., firmware), ormedia interface 208 to exchange data, information, or I/Os within thestorage controller 126. A static random-access memory 214 of the storagecontroller 126 may store processor-executable instructions or code forfirmware or drivers of the storage controller, which may be executed bythe processor cores 210. The storage controller 126 may also a dynamicrandom-access memory (DRAM) controller 216 and associated DRAM 218 forstorage or caching various data as the storage controller 126 moves databetween the host system 102, storage media 124, or other components ofthe storage controller.

FIG. 3 illustrates at 300 example configurations of various hardware andfirmware components for implementing an AI engine of a storage mediacontroller. In this example, the components of the storage controller126 are shown as abstracted entities that may be implemented in firmware302 or hardware 304 of the storage controller. This is but one exampleimplementation of various components, any of which may be implementedseparately from or in combination with other components describedherein. Alternately or additionally, any of the components describedwith reference to FIG. 2 or FIG. 3 may be implemented as an intellectualproperty block (IP block) or IP core configured as a unit of logic,cell, and/or integrated-circuit (IC) that provides various describedfeatures of a component. For example, a component of the storagecontroller 126 (e.g., AI engine 132) may be implanted as an IP core orthe IP block that includes a combination of hardware, firmware, orsoftware to provide respective functionalities or implement respectiveoperations of the component.

In this example, the hardware 304 of the storage controller 126 includesNAND flash devices 204, a host interface 206, a media interface 208, andprocessors 210, which may be implemented as described with reference toFIG. 2 . In some aspects, the AI engine 132 is implemented as part of anAI block 306 (e.g., AI IP block) that includes a processor core 308 onwhich the AI engine 132 executes or runs. The AI engine 132 may also runone or more of the AI models 134 and provide an AI engine driver 310 bywhich the AI engine 132 interacts with the firmware 302 of the storagecontroller 126. Alternately or additionally, the AI engine 132 (e.g.,light-weight AI engine 132 and models 134) may execute on the processorcores 210 of the storage controller 126 to provide the AI engine driver310.

Generally, the firmware 302 of the storage controller 126 assists thehardware 304 to manage the data path between the host system 102 andstorage media 124. In other words, the firmware 302 may translatecommands or requests for data received from the host system 102 toenable access of the storage media 124. As shown in FIG. 3 , thefirmware 302 includes a host interface driver 312 to implement a hostcommand handler 314 and a media interface driver 316 to implement mediacommand manager 318. As shown in FIG. 3 , host input/output commands 320(host I/Os 320) are received by the host command handler 314 and sent toan I/O scheduler 322 of a Flash translation layer 324 (FTL 324) of thestorage controller 126. The FTL 324 and/or I/O scheduler 322 may processand schedule the host I/Os 320, which may then be performed ascorresponding media input/output commands 326 (media I/Os 326) forstorage media access through the media command manager 318.

In various aspects, the FTL 324 manages command processing (e.g., hostI/O 320 translation and scheduling) to facilitate movement of hostsystem data within a storage system 114 and/or through a storagecontroller 126, such as to storage media 124. The FTL 324 may alsomonitor resources or health of the storage media 124 or storage mediadevices. For example, the FTL 324 may monitor the storage media 124 orcache DRAM 218 for an amount of free space, capacity, free blocks, badblocks, write/programming cycle count, device/block wear pattern, powerconsumption, temperature, or the like. In some cases, the FTL 324includes internal FTL tasks 328 (internal tasks 328) for management ofstorage media health and resources. These internal tasks 328 of the FTL324 or storage controller 126 may include tasks or operations thataccess the storage media, such as data migration, garbage collection,wear leveling, or the like. To implement the internal tasks 328, the FTL324 may generate internal I/Os for storage media access, which may thenbe performed as corresponding media I/Os 326 for storage media accessthrough the media command manager 318.

In this example, the firmware 302 of the storage controller 126 alsoincludes a device-level management 330 component to manage device-levelaspects of the storage media 124. For example, the device-levelmanagement 330 component may monitor or manage parameters of individualones of the NAND flash devices 204, NAND channels, memory chips, memorydies, physical memory blocks, or the like. In some cases, thedevice-level management 330 component monitors temperature conditions ofthe storage media 124 (e.g., die or device temperature) and implementsthermal control (e.g., throttling) based on a predefined or adaptivetemperature threshold. For example, the device-level management 330component may delay or restrict access to a specific NAND device or NANDchannel in response to a temperature of the NAND device or NAND channelexceeding the temperature threshold. Alternately or additionally, thedevice-level management 330 component may implement similar forms ofpower management (e.g., consumption-based) or other device-levelmonitoring and control of the storage media 124.

In aspects of AI-enabled management of storage media access, the mediaaccess manager 130 may interact with the AI engine 132 to optimize theprocessing and scheduling of various I/Os for access to the storagemedia 124. In some aspects, the media access manager 130 provides orforwards host I/Os 320 to an AI model 134 of the AI engine 132 as inputs332. The inputs 332 may include any suitable type of data or informationdescribing the host I/Os 320 (host system activity), such as an eventtype of the host I/O, an event duration of the host I/O, an event sizeof data associated with the host I/O, or the like. Based on the inputs332, the AI model 134 generates or derives as outputs 334 a predictionof host system behavior. The outputs 334 or prediction of host mayinclude any suitable type of data or information, such as an indicationof a duration of time until the host system becomes idle, a duration oftime for which the host system will remain idle, parameters regarding anext host I/O issued by the host system, or the like. The outputs 334may be received by the media access manager 130 directly or throughanother entity, such as the I/O scheduler 322 or the FTL 324 of thestorage controller 126. Using the prediction of host behavior, such aspredicted host events (e.g., write density or idle times) or predictedhost behavior information, the media access manager 130 may alter ormodify how the host I/Os 320 and internal I/Os are processed, scheduled,or performed as media I/Os for storage media access.

For example, in the context of internal tasks 328, the media accessmanager 130 may provide information describing host I/Os 320 to the AImodel 134 and receive from the AI model 134 a prediction of host systembehavior with respect to subsequent access of the storage media 124. Themedia access manager 130 and I/O scheduler 322 may then use indicationsof storage media health and resources, as well as the prediction of hostsystem behavior to process and schedule the host I/Os 320 (e.g., currentand subsequently received host I/Os) and internal I/Os for the internaltasks 328 to optimize access of the storage media 124. By so doing, I/Oscheduling may be improved by avoiding or reducing conflict between hostI/Os and internal I/Os that compete for access to a same storage mediaresource.

As another example, consider AI-enabled management of storage mediaaccess in the context of device-level management operations, such asthermal-based throttling of storage media access. In addition to usingcurrent I/O workload or memory device states to determine a degree oramount of thermal throttling, the media access manager 130 may also usea prediction of host system behavior provided by an AI model 134 of theAI engine 132. In some cases, the media access manager 130 may alter orpreempt (e.g., temporarily suspend) a thermal threshold of a memorydevice based on a predicted upcoming idle time, during which temperatureof the memory device will cool during inactivity. Alternately oradditionally, the media access manager 130 may reduce a degree to whichstorage media access is throttled based on the prediction of host systembehavior. As such, having the ability to use this prediction of hostsystem behavior may result in less throttling of storage media access,which in turn would improve host I/O performance. These are but a fewexamples of AI-enabled management of storage media access, other ofwhich are enable and described throughout this disclosure.

FIG. 4 illustrates at 400 an example configuration of an AI engine andpersistent memory for implementing multiple AI models. In some aspects,an AI engine 132 may include or have access to multiple AI models 134that are configured or trained to assist with or optimize differentrespective operations or tasks of a storage media controller. The AImodels 134 may be stored in a persistent storage media of the storagesystem, such as the storage media 124 or an internal memory of thestorage controller 126 (not shown). In this example, the AI models 134are stored on the NAND flash devices 204, such as those implemented bythe SSD 202 of FIG. 2 . Here, the AI models 134 include multiple AImodels, which are shown as AI model A 134-1 through AI model N 134-n,where n is any suitable integer. Each of the AI models 134 may beconfigured or trained for a respective internal task 328 or device-leveloperation of the storage controller, such as data migration, garbagecollection, wear leveling, thermal control, or the like. Alternately oradditionally, an AI model 134 may be configured or trained to assistwith multiple internal tasks 328 of the FTL 324 storage controller 126.

As shown at 402, the media access manager 130 or AI engine 132 may loadmultiple AI models 134 to assist with and/or optimize internal tasks 328of a storage system 114. In some cases, the multiple AI models 134 areloaded in response to the storage system 114 booting or powering up. Inother cases, one or more of the AI models 134 may be loaded on-demand oras requested by the AI engine 132. Generally, an AI model 134 may beloaded to assist with or enable optimization of a corresponding internaltask 328 of the storage system 114 or storage controller 126. In thisexample, assume the media access manager 130 or FTL 324 is running afirst internal task A 328-1 (e.g., data migration) and a second internaltask B 328-2 (e.g., garbage collection). Here, assume that the AI engine132 loads both AI model A 134-1 and AI model B 134-2 via the mediainterface 208 when the storage system 114 boots up.

In the context of the present example, the AI model A 134-1 and AI modelB 134-2 may be run sequentially or concurrently to assist with internaltask A and/or internal task B of the storage controller. As shown inFIG. 4 , the AI engine driver 310 may provide a first set of inputs332-1 to the AI model A 134-1 and a second set of inputs 332-2 to the AImodel B 134-2. The first set of inputs 332-1 may be the same, similarto, or different from the second set of inputs 332-2, such as configureddifferently for each internal task 328. To assist with the internaltasks, the AI model A provides a first set of outputs 334-1 to the I/Oscheduler 322 or media access manager 130 and the AI model B 134-2provides a second set of outputs 334-2 to the I/O scheduler 322 or mediaaccess manager 130. The first set of outputs 334-1 may be the same,similar to, or different from the second set of outputs 334-2, such asconfigured differently for each internal task 328.

FIG. 5 illustrates at 500 example configurations of a cache memory and astorage memory through which aspects of AI-enabled management of storagemedia access may be implemented. In this example, a cache memory isillustrated as a single level cell (SLC) Flash cache 502, though otherFlash or memory types may be used to cache data before writing tomulti-level cell (MLC) Flash storage 504. The MLC storage 504 may alsobe implemented with other Flash or memory types, such as triple-levelcell (TLC) Flash, quad-level cell Flash (QLC), XLC Flash, NOR Flash, orthe like. Generally, a storage system 114 may be implemented with an SLCcache 502 for improved burst write performance or to reduce wear on theMLC storage media 504. The SLC Flash or other types of cache memory,however, may be more expensive than the storage media such that the SLCcache 502 is implemented with less capacity than a main storage area ofMLC Flash memory.

As shown in FIG. 5 , host I/Os 320 of the host system 102 may cause datato be written to free space 506 of the SLC cache 502 (host I/O to mediaI/O conversion is omitted here for visual brevity). The data of the hostI/Os 320 may fill or occupy non-contiguous areas (e.g., partial pages orblocks) the SLC cache 502 as valid data 508 or consolidated data 510. Insome cases, the valid data 508 is organized and moved within the SLCcache 502 to form the consolidated data 510. As the SLC cache 502 fillswith data, the storage controller 126 migrates the valid data 508 orconsolidated data 510 to the MLC storage 504 as shown at 512.Alternately or additionally, the host I/Os 320 may also cause data writedirectly to the MLC storage 504, bypassing the SLC cache 502.

As the MLC storage 504 fills with data, the storage controller 126 mayimplement garbage collection of partially valid MLC blocks to free theseMLC blocks for reuse (e.g., enabling data writes as free MLC blocks) asshown at 514. As such, host I/Os 320 for data writes to the storagemedia may result in internal I/Os corresponding to data migration 512and garbage collection 514 as performed by the storage controller 126 orFTL 324. In some cases, the internal I/Os for data migration 512 andgarbage collection 514 arise concurrently with the host I/Os 320received from the host system 102. When the internal I/Os are allowed tocompete for access to the storage media (e.g., as corresponding mediaI/Os), host I/O performance, and thus overall host system performance,may be greatly impaired or reduced.

In aspects of AI-enabled management of storage media access the mediaaccess manager 130 may use the AI engine 132 in scheduling the host I/Os320 and internal I/Os to optimize host I/O performance and reduce oreliminate contention between the host I/Os and internal I/Os for accessto the storage media 124. Generally, the media access manager 130 mayimplement adaptive or optimized scheduling of host I/Os and internalI/Os utilizing predictions of host-related events and behavior. In otherwords, if future host I/O events or activities are predicted, based onpast host system activity, scheduling of internal FTL operations may beoptimized for predicted opportunities to avoid or reduce as manyconcurrent host I/Os and internal I/Os as possible. In various aspects,the media access manager 130 and/or AI engine 132 of a storagecontroller may implement predictive or adaptive internal tasks ordevice-level management for a storage system.

In some aspects, the media access manager 130 provides information tothe AI engine 132 regarding current or past activity of a host system,such as information or descriptive data for read or write commands(e.g., logical block address (LBA), size, timestamp) or idle time (e.g.,intervals, duty cycle, periodicity, frequency, duration, or the like).Based on the current or past host activity, the AI engine 132, using theAI models 134, may provide predictions of subsequent or future hostactivity, such as storage media access or idle time. A prediction ofstorage media access may include a type of access, an amount of data thehost system will write or read, how long a write or read will take tocomplete, or which data is likely to be invalidated or overwritten inthe future. Based on a prediction of future host system behavior, themedia access manager 130 may alter scheduling of host I/Os or internalI/Os by advancing or delaying various I/Os to avoid or reduce contentionfor access to the storage media. By so doing, host I/O performance maybe improved by efficiently masking or hiding execution of internal I/Osand taking advantage of host system idle time as perceived from the hostsystem (e.g., desktop or laptop computing devices).

FIG. 6 illustrates an example of predictive garbage collectionimplemented by the media access manager 130 and with an AI engine 132 ofa storage system controller. As shown at 600, host performance isdegraded by throttling or stalling that occurs often with conventionalgarbage collection. Without AI-enabled management, garbage collection istypically run whenever a write burst 602 of the host exceeds a level offree space 604 of storage media. The incoming data associated with thewrite burst 602 causes the level of free space to fall below a garbagecollection threshold 606, triggering the garbage collection that resultsin reduced host I/O performance 608 as internal I/Os for garbagecollection compete with host I/Os of the write burst 602 for storagemedia access.

In contrast with conventional techniques, an example of predictivegarbage collection enabled by the AI engine 132 and models 134 is shownat 610. In some aspects, a volume of storage media is configured withrespective thresholds and free space to support predictive garbagecollection 612 that may enable adaptive garbage collection operationsthat mitigate or avoid a reduction in host I/O performance (e.g.,preventing host I/O throttling). Using the AI engine 132, the mediaaccess manager 130 may receive information relating to a predicted writeburst 614 of a host system. The media access manager 130 may determinethat the predicted write burst 614 exceeds a level of free space 616 andimplement predictive garbage collection 612 internal I/Os before thepredicted write burst 614 is expected to occur. By so doing, thepredictive garbage collection 612 provides a new level of free space 618that may receive (or absorb) data of the predicted write burst 614 orother write bursts without triggering regular garbage collectioninternal I/Os that would interfere with host I/Os of the write burst.

FIG. 7 illustrates an example of internal I/O operations scheduled by anAI engine in accordance with one or more aspects. In this example, amedia access manager 130 may delay garbage collection and/or datamigration to improve host I/O performance. As shown at 700, withoutAI-enabled management, host writes 702 may trigger data migration at 704and then garbage collection at 706 as respective thresholds for each areencountered (e.g., as available free space decreases). When datamigration and garbage collection are performed to free up blocks ofstorage media, the internal I/Os contend with host I/Os for access tothe storage media interface and degrade host access as shown at 708.

In contrast with conventional techniques, an example of delayed garbagecollection and/or data migration enabled by the AI engine 132 and IAmodels 134 is shown at 710. In some aspects, the media access manager130 monitors host I/Os of a host system, which here include host I/Osfor host writes 712 to storage media. Based on the host writes 712, themedia access manager 130 may provide indications 714 of event types ordurations to an AI model 134 of the AI engine driver 310. Based on theindications 714 of host activity, the AI model 134 provides a predictionof host system behavior at 716. The prediction of host system behaviormay include information describing a next time to idle, next idleduration, a write density of host writes until the next idle of the hostsystem, or the like. Based on a prediction of upcoming idle time 718,the media access manager 130 may delay internal tasks of the storagecontroller as shown at 720. Without competing internal I/Os of the FTL324, full host performance is enabled at 722 relative where internaltasks of the storage controller would have degraded host I/O performancewith a conventional storage controller. Using the prediction of idletime of the host system, the media access manager 130 may then implementdelayed data migration 724 and delayed garbage collection 726 withoutconflicting with host I/Os of the host system.

Alternately or additionally, the media access manager 130 may implementadaptive cache management and dynamic bypass in accordance with one ormore aspects. Similar to adaptive garbage collection and data migration,the media access manager 130 may use a prediction of host systembehavior with respect to “write density” to manage use of a cache (e.g.,SLC Flash cache) through preemptive (e.g., early) data migration tostorage media (e.g., xLC Flash storage) or dynamic bypass of the cache.For example, based on a prediction of a large write burst, the mediaaccess manager 130 may perform early data migration from the cachememory to storage media to free enough space to absorb the write burst.In other cases, the media access manager 130 may determine that currentidle time is insufficient for data migration and instead cause the largewrite burst to bypass the cache memory for writing directly to thestorage media. This bypass data writing may be slower than using a cachememory of sufficient size, but avoids data migration from the cachememory, which if unable to absorb the entire write burst, wouldsubstantially degrade host I/O performance when concurrent datamigration triggers (e.g., much slower than the bypass data writing).

Various aspects described throughout the disclosure may be implementedby a media access manager 130 or FTL 324 that interacts with an AIengine, AI models, or AI driver of or associated with a storage system.With respect to processing various information of a storage system, theAI engine 132 and/or AI models 134 may be implemented withmachine-learning that is based on one or more neural networks for hostsystem activity or behavior prediction. Each AI model, AI algorithm, orneural network of the AI engine 132 may include a group of connectednodes, such as neurons or perceptrons, which are organized into one ormore layers.

By way of example, an AI model 134 (e.g., machine-learning model) of theAI engine 132 may be implemented with a deep neural network thatincludes an input layer, an output layer, and one or more hiddenintermediate layers positioned between the input layer and the outputlayers of the neural network. Each node of the deep neural network mayin turn be fully connected or partially connected between the layers ofthe neural network. An AI model or AI algorithm may be any deep neuralnetwork (DNN), such as a convolutional neural network (CNN) includingone of AlexNet, ResNet, GoogleNet, MobileNet, or the like. Alternatelyor additionally, an AI model may include any suitable recurrent neuralnetwork (RNN) or any variation thereof. Generally, an AI model or AIalgorithm employed by the AI engine 132 may also include any othersupervised learning, unsupervised learning, reinforcement learningalgorithm, or the like.

In various aspects, an AI model 134 of the AI engine 132 may beimplemented as a recurrent neural network with connections between nodesforming a cycle to retain information from a previous portion of aninput data sequence for a subsequent portion of the input data sequence(e.g., host I/Os or event descriptions). Alternately, an AI model may beimplemented as a feed-forward neural network having connections betweenthe nodes that do not form a cycle between input data sequences. In yetother cases, an AI model 134 of the AI engine 132 may include aconvolutional neural network (CNN) with multilayer perceptrons whereeach neuron in a given layer is connected with all neurons of anadjacent layer. In some aspects, the AI model 134 based on aconvolutional neural network may be applied to previous host systemactivity to predict or forecast some form of subsequent or future hostsystem behavior or activity. Alternately or additionally, the AI engine132 may include or utilize various regression models, such as multiplelinear regression models, a single linear regression model, logisticalregression models, step-wise regression models, multi-variate adaptiveregression models, locally estimated scatterplot models, or the like.

FIG. 8 illustrates at 800 example host I/O event types that are usefulto an AI model in predicting host system behavior in accordance with oneor more aspects. In this example, a Flash translation layer 324 is shownprocessing host I/Os 320 to generate corresponding media I/Os 326 foraccess to storage media 124. These host I/Os 320 may include orcorrespond to host I/O events 802-0 through 802-t for respective storagemedia access operations. Based on the host I/Os 320, the media accessmanager 130 provides inputs 332 to one or more of the AI models 134 ofthe AI engine 132. As shown in FIG. 8 , the inputs 332 may include eventdescriptors or descriptions that include an event type, an event size,an event duration, an event timestamp, or the like. Based on the inputs332 (e.g., host I/O events), the AI models 134 may generate or provideoutputs 334 to the media access manager 130 for predicted host behavior.As shown in FIG. 8 , the outputs 334 for predicted host behavior mayinclude a next time to idle, next idle duration, write density (ofsubsequent host system access), or the like.

In some aspects, the media access manager 130 or FTL 324 forwardsdescriptions or descriptors of these I/O events to the AI models 134 forprocessing with a neural network configured to enable prediction offuture host behavior or activity. In the context of the present example,the host I/O events 802 may be described or classified as shown inequation 1, where x_(t) is an event type and d_(t) is the duration ofthe event.

E _(t)=[x _(t) ,d _(t)]   Equation 1: Host I/O Event DescriptorStructure

Generally, an event type or I/O command may be described with referenceto a type of access (e.g., read/write) and size of the access (e.g., 4KB, 8 KB, or 16 KB). An event duration (d_(t)) may be defined as a timefrom arrival of the event or I/O command until the arrival of a next orsubsequent I/O command With respect to idle event and idle time, if aduration of time associated with an event is longer than a predefinedidle duration threshold (D), the event may be classified as “idle” andan associated duration time classified as “idle time”. As shown at 804,an input sequence for the AI model 134 may be formed from multiple eventdescriptors 806 where at any step (t) a history of host I/O events maybe defined as shown in equation 2.

E _(t−w+1) ,E _(t−w+2) , . . . E _(t−1) ,E _(t)   Equation 2: Host I/OEvent History

With an input sequence of host I/O events, a trained AI model 134 may beprovided with an idle duration threshold with which to predict a thetime to next idle (û_(t)), next idle duration ({circumflex over(v)}_(t)), and write intensity (ŝ_(t)). By way of example, consider FIG.9 which illustrates an example implementation of an AI model 900configured to predict host I/O behavior. In various aspects, an AI model134 may be based on recurrent neural network (RNN) architecture, whichmay prove advantageous in processing, with memory, a history of inputslike the I/O activity of a host system.

As shown in FIG. 9 , the AI model 900 includes an embedding layer 902,recurrent layer 904, and respective prediction layers and classificationlayers 906 through 910 for “time to next idle”, “next idle duration”,and “write amount until next idle”. In this example, input weights 912of the AI model include duration inputs 914 and event inputs 916, whichmay be generated and formatted by the media access manager 130 asdescribed herein. In some cases, embedding weights 918 are applied tothe event inputs 916 at the embedding layer 902 prior to the recurrentlayer 904. The recurrent layer 904 may also include hidden weights 920which may be applied to connect an instance of the AI model 900 with aprevious step or run of the AI model 900.

As shown in FIG. 9 , the AI model 900 also includes respective weights922 through 926 for predicting “time to next idle”, “next idleduration”, and “write amount until next idle”. Similarly, any or all ofthe prediction layers or classification layer 906 through 910 mayprovide corresponding predictions 928 through 932 of host I/O activityquantities of “time to next idle”, “next idle duration”, and “writeamount until next idle”. Generally, regression loss functions 934 and936, as well as a classification loss function 938, are used to trainthe model weights using back-propagation. Alternately or additionally,the AI model 900 may include an attention layer to enable the AI modelto learn which weights or layers to attend based on a history ofprevious hidden states.

As another example of predicted host behavior, consider FIG. 10 whichillustrates examples of predicted host behavior including various I/Osor idle time. In some aspects, a multi-stage approach may be implementedby an AI model 134 to predict a time to next idle, a next idle duration,and/or a write density of host I/O activity. As shown at 1000, the AImodel 134 takes a history of w events as an input sequence to predict anext event 1002 Ê_(t+1) to serve as an intermediate result. Here, assumethat the AI model 134 predicts a 4 KB read operation with 95%probability (shown as next event 1002) and an 8 KB write operation with5% probability. Based on the input sequence, the AI model 134 may alsopredict a duration {circumflex over (d)}_(t+1) for the next event 1002(e.g., for the 4 KB read operation).

This intermediate result, event 1002, may be provided back to the AImodel 134 at 1004 as a next event of the input sequence in order topredict a next event 1006 Ê_(t+2). Here, assume that the next event 1006may be predicted through probability to most likely be an 8 KB write ofduration {circumflex over (d)}_(t+2). This recursive approach of the AImodel 134 may be performed and/or repeated until an idle time event ispredicted. For example, the recursive approach is repeated as shown at1008 until a step t+T 1010 where the predicted duration {circumflex over(d)}_(t+T) is longer than a predefined idle duration threshold D, atwhich point an idle event is predicted, or step t+T reaches a predefinedlookahead step S. If the recursive approach results in prediction of anidle event, a subsequent prediction stage may be used to predict otherI/O-related quantities for host system activity, as shown in equations3-5 below.

predicted “time to next idle event”={circumflex over (d)} _(t+1) + . . .+{circumflex over (d)} _(t+T-1)  [3]

predicted “next idle duration”={circumflex over (d)} _(t+T)  [4]

predicted “write density”=Σ_(j)[Ê _(j)]_(data size)  [5]

-   -   where j=t+1, . . . t+T such that Ê_(j) is a write event

Equations 3-5: Quantities for predicted idle event-related host systembehavior

By way of example, consider FIG. 11 which illustrates an exampleimplementation of an AI model 1100 configured for multi-stage predictionof host I/O behavior. As shown in FIG. 11 , the AI model 1100 includesan embedding layer 1102, recurrent layer 1104, a prediction layer 1106,and classification layer 1108. In this example, input weights 1110 ofthe AI model 1100 include duration inputs 1112 and event inputs 1114,which may be generated and formatted by the media access manager 130 asdescribed herein. In some cases, embedding weights 1116 are applied tothe event inputs 1114 at the embedding layer 1102 prior to the recurrentlayer 1104. The recurrent layer 1104 may also include hidden weights1118 which may be applied to connect an instance of the AI model 1100with a previous step or run of the AI model 1100.

As shown in FIG. 11 , the AI model 1100 also includes “event duration”weights 1120 predicting various event durations and “event type” weights1122 for classifying events, which may be provided to the predictionlayer 1106. The prediction layer 1106 and classification layer 1108 mayprovide respective event predictions 1124 and event classifications 1126for a next or intermediate event. Generally, regression loss function1128 and classification loss function 1130 are used to train the modelweights using back-propagation. Alternately or additionally, the AImodel 1100 may include an attention layer to enable the AI model tolearn which weights or layers to attend based on a history of previoushidden states.

FIG. 12 illustrates at 1200 an example beam search useful to determine apath of predicted host behavior based on event probability. In someaspects, a beam search may be used to reduce or mitigate errorpropagation through predictions of host system behavior. Generally, ateach time step, the beam search implemented by the media access manager130 or AI engine 132 may keep K events with a highest joint or combinedprobability. By so doing, the beam search may increase a probabilitythat the AI engine 132 selects a best path of events from time step t+1to time step t+T. As shown in FIG. 12 , for K=3 events, the beam searchmay proceed along paths from event Ê_(t+1) 1202 to event Ê_(t+2) 1204based on assigned probabilities for each next event. Based on a chosenpath from the beam search to idle event Ê_(t+T) 1206, the AI engine 132may predict a “time to next idle event”, “next idle duration”, or “writedensity” using the AI models 134 as described herein.

FIG. 13 illustrates at 1300 an example timeline of operations for onlinere-training or online refinement of an AI model, which may enable the AImodel to adapt to a user- or device-specific I/O workload. Generally, anAI model used to predict host system I/O events or behavior may betrained offline using a large data set that is generic to a given typeof device or application before deployment of a storage system to auser. Once the device is used by the user, various applications andday-to-day usage patterns typically vary from one user to the next. Assuch, effectiveness or accuracy of an AI model may be improved withonline (e.g., run-time) refinement or re-training based on actual usageof the end user.

With respect to refining or retraining an AI model, the followingdiscussion may apply to either as these operations may be implementedwith same or similar operations. For example, an AI model may be refinedwhen the AI model is robust or already trained for a sufficient lengthof time on data meaningful to a corresponding user. In other cases, theAI model may be retrained if the AI model is not accurate with respectto best parameters for a specific user of the device. Refinement of theAI model may include a partial update of model parameters, such as anumber of layers in the model or modifying parameters in a given layerof a DNN. With respect to retraining, most or all of the AI model may beadjusted, such as with an introduction, extension, or removal of variousparameters, weights, or layers of a DNN. In various aspects, thefirmware of the storage controller 126, AI engine 132, or AI enginedriver 310 may retrain or refine one or more of the AI models 134.

In some aspects, an inference process and online training on an AI modelare performed concurrently or in parallel. The outputs provided by theinference process may be used for both predicting host system behaviorand for online training purposes. To do so, the firmware of the storagecontroller or AI engine 132 may synchronize or coordinate steps of theinference process and online training to ensure a correct or optimal setof model weights is used during the concurrent online training process.Generally online training (or refinement) includes a forward step and abackward step, which may be performed or repeated as necessary. Theforward step may be similar or same as an inference step. The backwardstep updates weights or parameters of the AI model being retrainedthrough back-propagation. In some cases, back-propagation includes acalculation of how much or to what degree one or more of the modelweights are modified. For example, the AI engine 132 may calculate aweight modifier (Δw) for each model weight and then add or subtract thatweight modifier from the corresponding weight.

As shown in FIG. 13 , when an AI model is trained or refined, two modelsor instances of the AI model may operate in parallel. In this example,an inference process 1302 runs concurrently or in parallel with onlinetraining 1304 of the AI model. Generally, one AI model may be designatedas a “current model” and another AI model designated as an “updatemodel”, which provides retrained or improved model weights. Theinstances of the AI model may be implemented concurrently or in parallelthrough the use of multiple buffers that store respective versions of AImodel parameters. With reference to FIG. 13 , these buffers are labeledas buffer A 1306 and buffer B 1308. As shown in FIG. 13 , the inferenceprocess 1302 may run continuously and the online training process 1304may also run continuously with the exception of when weights areupdated.

In some aspects, the inference process 1302 and online training 1304execute on the same hardware (not shown) to run inference steps andonline training processes. In other words, an inference part (or forwardpart) of the online training 1304 and the inference process 1302 may runon a same instance of the hardware illustrated in FIGS. 1-4 and/or othersystems (e.g., FIG. 19 ) or controllers (e.g., FIG. 20 ) as describedherein. For example, a single instance of inference hardware may beimplemented to perform both the inference process 1302 and onlinetraining 1304 of FIG. 13 . As shown in FIG. 13 , the inference process1302 may continue to run using weights of buffer B 1308 as weights ofBuffer A 1306 are updated. Similarly, the inference process 1302 maycontinue to run using the weights of buffer A 1306 as the weights ofbuffer B 1308 are updated.

In the context of the present example, online training 1304 may be runbased on a current AI model 134 in buffer A 1306. The AI engine 132 runsinference for one or multiple instances (steps) with the current AImodel 134 or model weights in buffer A 1306 as one batch 1310 (e.g.,batch 1 1310-1). On completion of a batch 1310, the AI model 134 isupdated through back-propagation 1312, which may be implemented by aprocessor core of the AI engine 132 or the storage controller 126.Updating the AI model 134 or model weights in buffer B 1308 consumestime, during which the inference process 1302 may continue to runthrough the other instance of the AI model 134. As the weight modifiers1314-1 of batch 1 1310-1 are updated in buffer B, the online trainingprocess 1304 may be locked (model parameter adjustment pauses) while theinference process 1302 continues to run on Buffer A. By so doing, theonline training process 1304 can be implemented without degradingperformance of the inference process 1302, while enabling quick andeffective training of the AI model.

After back-propagation is complete and the AI model 134 or model weightsin buffer B 1308 are updated, this buffer becomes the “current” bufferfor a next batch of online training 1304 and for the inference process1302. In other words, batch 2 1310-2 is run with the AI model 134 ormodel weights stored in buffer B 1308 and buffer A 1306 becomes the“update” buffer. As shown in FIG. 13 , in alternating fashion buffer1306 A holds the “current” model or model weights while buffer B 1308 isused as the “update” model or model weights. On completion of a modelupdate, the buffer then switch roles in that buffer 1308 B holds the“current” model or model weights while buffer A 1306 is used as the“update” model or model weights. In other words, the online training“ping-pongs” between the buffers with each batch (e.g., batch 3, batch4, and so on) to implement a continuous online training process. Duringthe online or runtime training, the inference process 1302 may use themodel or model weights of the “current” buffer to provide adaptivepredictions of host system behavior. In other aspects, AI modelrefinement of retraining may be performed such that AI model training isimplemented during a first set of time intervals and inference isperformed in a second set of different time intervals.

Techniques for AI-Enabled Management of Storage Media Access

The following discussion describes techniques for AI-enabled managementof storage media access, which may schedule host I/Os or internal I/Osto optimize performance of a storage drive. These techniques may beimplemented using any of the environments and entities described herein,such as the media access manager 130, AI engine 132, and/or AI models134. These techniques include various methods illustrated in FIGS. 14-18, each of which is shown as a set of operations that may be performed byone or more entities.

These methods are not necessarily limited to the orders of operationsshown in the associated figures. Rather, any of the operations may berepeated, skipped, substituted, or re-ordered to implement variousaspects described herein. Further, these methods may be used inconjunction with one another, in whole or in part, whether performed bythe same entity, separate entities, or any combination thereof. Forexample, the methods may be combined to implement AI-enabled managementof storage media to schedule host I/Os and/or internal I/Os of a storagemedia system based on a prediction of host system behavior to optimizehost system performance with respect to storage media access (e.g.,latency or throughput). In portions of the following discussion,reference will be made to the operating environment 100 of FIG. 1 andvarious entities or configurations of FIGS. 2-13 by way of example. Suchreference is not to be taken as limiting described aspects to theoperating environment 100, entities, or configurations, but rather asillustrative of one of a variety of examples. Alternately oradditionally, operations of the methods may also be implemented by orwith entities described with reference to the System-on-Chip of FIG. 19and/or the storage system controller of FIG. 20 .

FIG. 14 depicts an example method 1400 for AI-enabled management ofstorage media access, including operations performed by or with themedia access manager 130 or AI engine 132 of a media storage controller.

At 1402, host I/Os for access to storage media are received from a hostsystem. The host I/Os may be received from a host system via a hostsystem interface of a storage system or storage controller. In somecases, the host I/Os include or correspond to write or read commands ofthe host system that include a respective logical block address, size,or timestamp.

At 1404, information describing the host I/Os of the host system areprovided to an AI engine. The AI engine may be associated with thestorage system, as part of a storage controller or accessible by thestorage controller of the storage system. In some cases, the informationdescribing or descriptors of the host I/Os includes an event type of thehost I/O, an event duration of the host I/O, or an event size of dataassociated with the host I/O.

At 1406, a prediction of host system behavior is received from the AIengine. The prediction of host system behavior may include informationdescribing host system activity with respect to subsequent access of thestorage media by the host system. In some cases, the prediction of hostsystem behavior includes an indication of a duration of time until thehost system becomes idle, a duration of time for which the host systemwill remain idle, or parameters regarding a next host I/O issued by thehost system.

At 1408, the host I/Os for access to the storage media are scheduledbased on the prediction of host system behavior. The host I/Os may bescheduled to preempt other I/Os of internal tasks of the storage system.In some cases, the host I/Os are scheduled or performed with modifiedparameters or thresholds for garbage collection, caching, datamigration, or thermal throttling. For example, the media access managermay suspend or preempt one or more thresholds to prevent internalstorage system operations from triggering to enable completion of thehost I/Os.

Optionally at 1410, internal I/Os of the storage system are scheduledbased on the prediction of host system behavior. In some cases, themedia access manager determines that the storage system has internalI/Os that are pending. The scheduling of the internal I/Os may includeadvancing or delaying the internal I/Os of the storage system based onthe prediction of host system behavior, such as to mitigate contentionbetween the internal I/Os and the host I/Os or subsequent host I/Os foraccess to the storage media.

At 1412, media I/Os that correspond to the scheduled host I/Os and/orinternal I/Os of the storage system are performed. The FTL of thestorage controller may generate respective media I/Os that correspond tothe scheduled host I/Os and the internal I/Os of the storage system. Inresponse to the media I/Os, the storage system may return read data tothe host system or store data of the host system to the storage system.

FIG. 15 depicts an example method 1500 for delaying an internaloperation of a storage system based on a prediction of host systembehavior. In some aspects, the media access manager 130 or the AI engine132 of a media storage controller perform the operations of the method1500.

At 1502, information describing respective types and durations of hostI/O events are provided to an AI model of an AI engine. The host I/Oevents may specify or request access to storage media of a storagesystem with which the AI engine is associated. In some cases, theinformation describing (or descriptors) of the host I/Os events includesan event type of the host I/O, an event duration of the host I/O, or anevent size of data associated with the host I/O.

At 1504, a prediction of upcoming idle time of the host system isreceived from the AI model of the AI engine. The prediction of theupcoming idle time may include an indication of a duration of time untilthe host system becomes idle, a duration of time for which the hostsystem will remain idle, or parameters regarding a next host I/O issuedby the host system. In some cases, the prediction includes an indicationof expected host I/O activity that is likely to occur before the nextidle time of the host system.

At 1506, an internal operation or internal task of the storage system isdelayed based on the prediction of upcoming idle time of the hostsystem. The internal operation may include garbage collection, datamigration, or other tasks that involve access of the storage media. Insome cases, the internal operations are triggered by respectivethresholds, which may be suspended or preempted while the internaloperations are delayed.

At 1508, media I/Os that correspond to host I/Os of the host system areperformed while the internal operation of the storage system is delayed.The media I/Os performed may correspond to write commands or readcommands of the host system. While the internal operation of the storagesystem is delayed, the media I/Os may be performed without competitionfrom or contention with internal I/Os of the storage system. As such,the host I/O performance may be optimized be permitting the host systemfull access to the storage media (e.g., without throttling or stalling).

At 1510, other media I/Os that correspond to internal I/Os of thedelayed internal operation are performed during the upcoming idle time.The internal I/Os may correspond to delayed data migration or delayedgarbage collection that is performed during the idle time of the hostsystem. In some cases, the delay is effective to prevent the datamigration or garbage collection from degrading performance of host I/Os,such that host system access to the storage media is optimized.

FIG. 16 depicts an example method 1600 for advancing an internaloperation of a storage system based on a prediction of host systembehavior. In some aspects, the media access manager 130 or the AI engine132 of a media storage controller perform the operations of the method1600.

At 1602, information describing host I/Os of a host system are providedto an AI model of an AI engine. The host I/Os may include I/O commandsor I/O operations for access to storage media of a storage system withwhich the AI engine is associated. In some cases, the informationdescribing (or descriptors) of the host I/Os events includes an eventtype of the host I/O, an event duration of the host I/O, or an eventsize of data associated with the host I/O.

At 1604, a prediction of access activity of the host system is receivedfrom the AI model of the AI engine. The prediction of access activitymay include a write density of the host system, an amount of data to bewritten to the storage media, or an expected duration of the writeactivity. In some cases, the prediction also includes an amount of timeuntil the write activity occurs, such that the media access manager maydetermine whether other internal tasks may be performed before thepredicted activity of the host system begins.

At 1606, an internal operation of the storage system is advanced basedon the prediction of access activity of the host system. The internaloperation may include garbage collection or data migration, such as tofree space of a cache memory or increase the amount of free space beyonda nominal threshold for free space in the cache memory or storage media.For example, the internal operation may be advanced and configured tofree up enough cache space to prevent the predicted activity of the hostsystem from triggering internal operations while the activity occurs.

At 1608, media I/Os that correspond to internal I/Os of the advancedinternal operation are performed. The media I/Os may correspond toadvanced or anticipated garbage collection or data migration for a cachememory or storage media. In at least some cases, the advanced orpredictive internal operations create enough free space in a cachememory or storage media to enable the host system activity without theactivity triggering thresholds for additional internal operations.

At 1610, other media I/O that correspond to host I/Os of the predictedaccess activity of the host system are performed. After at least some ofthe media I/Os of the internal operations are performed, the media I/Osfor the host I/Os are performed for storage media access. In some cases,the free space provided by the advanced internal operations enable themedia I/Os that correspond to the host I/Os to be performed at a fulllevel of host system access. As such, by advancing the internaloperations of the storage system, host system access to the storagemedia may be optimized by preventing internal I/Os from competing withthe host I/Os for access to the storage media.

FIG. 17 depicts an example method 1700 for altering a parameter fordevice-level management of storage media based on a prediction of hostsystem behavior. In some aspects, the media access manager 130 or the AIengine 132 of a media storage controller perform the operations of themethod 1700.

At 1702, host I/Os for access to storage media are received from a hostsystem. The host I/Os may be received from a host system via a hostsystem interface of a storage system or storage controller. In somecases, the host I/Os include or correspond to write or read commands ofthe host system that include a respective logical block address, size,or timestamp.

At 1704, information describing the host I/Os are provided to anartificial intelligence engine. The AI engine may be associated with thestorage system, as part of a storage controller or accessible by thestorage controller of the storage system. In some cases, the informationdescribing or descriptors of the host I/Os includes an event type of thehost I/O, an event duration of the host I/O, or an event size of dataassociated with the host I/O.

At 1706, a prediction of host behavior is received from the artificialintelligence engine. The prediction of host system behavior may includeinformation describing host system activity with respect to subsequentaccess of the storage media by the host system. In some cases, theprediction of host system behavior includes an indication of a durationof time until the host system becomes idle, a duration of time for whichthe host system will remain idle, or parameters regarding a next hostI/O issued by the host system (e.g., write intensity).

At 1708, a parameter for device-level management of the storage media isaltered based on the prediction of host system behavior. The parametermay include a threshold for thermal management of storage media devices.For example, based on an upcoming idle time, the media access managermay preempt or suspend a thermal limit to allow host I/Os for a datawrite to complete at full performance without thermal throttling accessto the storage media. Alternately or additionally, a threshold forgarbage collection or data migration may be preempted or suspended toallow full host I/O performance for an access operation.

Based on a prediction of upcoming idle time (no host system activity),the host system may be provided will full performance access knowingthat the storage media devices will have sufficient time to cool aftercompletion of the operation. In other cases, the media access managermay determine that a write operation will likely complete withoutrunning out of free space even if garbage collection or data migrationis delayed until after the write operation (e.g., altering or suspendingthe thresholds for either).

At 1710, media I/Os that correspond to host I/Os of the host system areperformed in accordance with the altered parameter for device-levelmanagement. The FTL of the storage controller may generate respectivemedia I/Os that correspond to the scheduled host I/Os and the internalI/Os of the storage system. The media I/Os may be performed inaccordance with an altered or suspended thermal limit for the storagemedia or storage media devices. Alternately or additionally, the mediaI/Os may be performed in accordance with an altered or suspendedthreshold for data migration or garbage collection.

FIG. 18 depicts an example method 1800 for running inferences withmultiple instances of an AI model to enable online retraining orrefinement, including operations performed by or with the media accessmanager 130 of the storage system.

At 1802, inferences are run, via hardware of an AI engine, with firstparameters of a first instance (e.g., respective weights or parameters)of an AI model. The first parameters may be stored to a first buffer ormemory area. In some cases, the first instance of the AI model is a“current” instance of the AI model used to run inferences. Theinferences may be run in parallel with an online training process thatruns on the same hardware of the AI engine.

At 1804, second parameters of a second instance of the AI model areupdated based on the first parameters of the first instance of the AImodel. The second parameters may be stored to a second buffer or memoryarea. In some cases, the second instance of the AI model is an “update”instance of the AI model that is updated from the “current” instancerunning on the hardware. During the update of the second parameters ofthe second instance of the AI model, the online retraining process maybe paused. At 1806, while the second parameters of the second instanceare updated, the inferences continue to run with the first parameters ofthe first instance of the AI model.

At 1808, inferences are run, via the hardware of the AI engine, with thesecond parameters of the second instance of the AI model. The secondinstance of the AI model may become the “current” instance of the AImodel used to run the inferences. The first instance of the AI model maybecome the “update” instance of the AI model that is updated from the“current” instance running on the hardware. The inferences may be run inparallel with an online training process that runs on the same hardwareof the AI engine.

At 1810, the first parameters of the first instance of the AI model areupdated based on the second parameters of the second instance of the AImodel. During the update of the first parameters of the first instanceof the AI model, the online retraining process may be paused. At 1812,while the first parameters of the first instance are updated, theinferences continue to run with the second parameters of the secondinstance of the AI model. From operation 1812, the method 1800 mayreturn to operation 1802 to start another iteration of the onlinetraining process.

System-on-Chip

FIG. 19 illustrates an exemplary System-on-Chip (SoC) 1900 that mayimplement various aspects of AI-enabled management of storage mediaaccess. The SoC 1900 may be implemented in any suitable system ordevice, such as a smart-phone, netbook, tablet computer, access point,network-attached storage, camera, smart appliance, printer, set-top box,server, data center, solid-state drive (SSD), hard disk drive (HDD),storage drive array, memory module, automotive computing system, or anyother suitable type of device (e.g., others described herein). Althoughdescribed with reference to a SoC, the entities of FIG. 19 may also beimplemented as other types of integrated circuits or embedded systems,such as an application-specific integrated-circuit (ASIC), memorycontroller, storage controller, communication controller,application-specific standard product (ASSP), digital signal processor(DSP), programmable SoC (PSoC), system-in-package (SiP), orfield-programmable gate array (FPGA).

The SoC 1900 may be integrated with electronic circuitry, amicroprocessor, memory, input-output (I/O) control logic, communicationinterfaces, firmware, and/or software useful to provide functionalitiesof a computing device, host system, or storage system, such as any ofthe devices or components described herein (e.g., storage drive orstorage array). The SoC 1900 may also include an integrated data bus orinterconnect fabric (not shown) that couples the various components ofthe SoC for control signaling, data communication, and/or routingbetween the components. The integrated data bus, interconnect fabric, orother components of the SoC 1900 may be exposed or accessed through anexternal port, parallel data interface, serial data interface,fabric-based interface, peripheral component interface, or any othersuitable data interface. For example, the components of the SoC 1900 mayaccess or control external storage media, AI engines, AI models, or AInetworks through an external interface or off-chip data interface.

In this example, the SoC 1900 includes various components such asinput-output (I/O) control logic 1902 and a hardware-based processor1904 (processor 1904), such as a microprocessor, processor core,application processor, DSP, or the like. The SoC 1900 also includesmemory 1906, which may include any type and/or combination of RAM, SRAM,DRAM, non-volatile memory, ROM, one-time programmable (OTP) memory,multiple-time programmable (MTP) memory, Flash memory, and/or othersuitable electronic data storage. In some aspects, the processor 1904and code stored on the memory 1906 are implemented as a storage systemcontroller or storage aggregator to provide various functionalitiesassociated with AI-enabled management of storage media access. In thecontext of this disclosure, the memory 1906 stores data, code,instructions, or other information via non-transitory signals, and doesnot include carrier waves or transitory signals. Alternately oradditionally, SoC 1900 may comprise a data interface (not shown) foraccessing additional or expandable off-chip storage media, such assolid-state memory (e.g., Flash or NAND memory), magnetic-based memorymedia, or optical-based memory media.

The SoC 1900 may also include firmware 1908, applications, programs,software, and/or operating system, which may be embodied asprocessor-executable instructions maintained on the memory 1906 forexecution by the processor 1904 to implement functionalities of the SoC1900. The SoC 1900 may also include other communication interfaces, suchas a transceiver interface for controlling or communicating withcomponents of a local on-chip (not shown) or off-chip communicationtransceiver. Alternately or additionally, the transceiver interface mayalso include or implement a signal interface to communicate radiofrequency (RF), intermediate frequency (IF), or baseband frequencysignals off-chip to facilitate wired or wireless communication throughtransceivers, physical layer transceivers (PHYs), or media accesscontrollers (MACs) coupled to the SoC 1900. For example, the SoC 1900may include a transceiver interface configured to enable storage over awired or wireless network, such as to provide a network attached storage(NAS) volume with AI-enabled management of storage media access.

The SoC 1900 also includes a media access manager 130, AI engine 132,and AI models 134, which may be implemented separately as shown orcombined with a storage component, data interface, or accessible throughan off-chip interface (e.g., AI models stored to external memory). Inaccordance with various aspects of AI-enabled management of storagemedia access, the media access manager 130 may interact with the AIengine 132 to generate a prediction of host behavior with respect tofuture storage media access and schedule, based on the prediction,internal operations (e.g., internal task I/Os) to optimize storage mediaperformance. Alternately or additionally, the AI engine 132 mayimplement multiple AI models 134 to predict host behavior, which may beexecuted concurrently and/or with online refinement or retraining forimproved prediction of host-specific behavior. Any of these entities maybe embodied as disparate or combined components, as described withreference to various aspects presented herein. Examples of thesecomponents and/or entities, or corresponding functionality, aredescribed with reference to the respective components or entities of theenvironment 100 of FIG. 1 or respective configurations illustrated inFIG. 2 through FIG. 13 . The media access manager 130 or AI engine 132,either in whole or part, may be implemented as processor-executableinstructions maintained by the memory 1906 and executed by the processor1904 to implement various aspects and/or features of AI-enabledmanagement of storage media access.

The media access manager 130, may be implemented independently or incombination with any suitable component or circuitry to implementaspects described herein. For example, the media access manager 130 orAI engine 132 may be implemented as part of a DSP, processor/storagebridge, I/O bridge, graphics processing unit, memory controller, storagecontroller, arithmetic logic unit (ALU), or the like. The media accessmanager 130 may also be provided integral with other entities of SoC1900, such as integrated with the processor 1904, memory 1906, a storagemedia interface, or firmware 1908 of the SoC 1900. Alternately oradditionally, the media access manager 130, AI engine 132, and/or othercomponents of the SoC 1900 may be implemented as hardware, firmware,fixed logic circuitry, or any combination thereof.

As another example, consider FIG. 20 which illustrates an examplestorage system controller 2000 in accordance with one or more aspects ofAI-enabled management of storage media access. In various aspects, thestorage system controller 2000 or any combination of components thereofmay be implemented as a storage drive controller, storage mediacontroller, NAS controller, Fabric interface, NVMe target, or storageaggregation controller for solid-state storage media. In some cases, thestorage system controller 2000 is implemented similar to or withcomponents of the SoC 1900 as described with reference to FIG. 19 . Inother words, an instance of the SoC 1900 may be configured as a storagesystem controller, such as the storage system controller 2000 to managesolid-state (e.g., NAND Flash-based) media with AI assistance.

In this example, the storage system controller 2000 includesinput-output (I/O) control logic 2002 and a processor 2004, such as amicroprocessor, processor core, application processor, DSP, or the like.In some aspects, the processor 2004 and firmware of the storage systemcontroller 2000 may be implemented to provide various functionalitiesassociated with AI-enabled management of storage media access, such asthose described with reference to any of methods 1400 through 1800. Thestorage system controller 2000 also includes a host interface 2006(e.g., SATA, PCIe, NVMe, or Fabric interface) and a storage mediainterface 2008 (e.g., NAND interface), which enable access to a hostsystem and storage media, respectively. The storage system controller2000 also includes a Flash translation layer 2010 (FTL 2010),device-level manager 2012, and I/O scheduler 322. In some aspects ofAI-enabled management of storage media access, the FTL 2010 and/ordevice-level manager 2012 generate internal I/Os of the storage systemcontroller or manage storage media devices through the storage mediainterface 2008.

The storage system controller 2000 also includes instances of a mediaaccess manager 130, AI engine 132, and AI models 134. Any or all ofthese components may be implemented separately as shown or combined withthe processor 2004, host interface 2006, storage media interface 2008,Flash translation layer 2010, device-level manager 2012, and/or I/Oscheduler 322. Examples of these components and/or entities, orcorresponding functionality, are described with reference to therespective components or entities of the environment 100 of FIG. 1 orrespective configurations illustrated in FIG. 2 through FIG. 13 . Inaccordance with various aspects of AI-enabled management of storagemedia access, the media access manager 130 may interact with the AIengine 132 of the storage system controller 2000 to generate aprediction of host behavior with respect to future storage media accessand schedule, based on the prediction, internal operations (e.g.,internal task I/Os) to optimize storage media performance. Alternatelyor additionally, the AI engine 132 may implement multiple AI models 134to predict host behavior, which may be executed concurrently and/or withonline refinement or retraining for improved prediction of host-specificbehavior. The media access manager 130, either in whole or part, may beimplemented as processor-executable instructions maintained by memory ofthe controller and executed by the processor 2004 to implement variousaspects and/or features of AI-enabled management of storage mediaaccess.

Although the subject matter has been described in language specific tostructural features and/or methodological operations, it is to beunderstood that the subject matter defined in the appended claims is notnecessarily limited to the specific examples, features, or operationsdescribed herein, including orders in which they are performed.

In the following, some examples are described:

Example 1: A method for artificial intelligence-enabled management ofstorage media access, comprising:

receiving, from a host system and via a host interface of a storagesystem, host input/outputs (I/Os) for access to storage media of thestorage system;

providing, to an artificial intelligence engine associated with thestorage system, information describing the host I/Os received from thehost system;

receiving, from the artificial intelligence engine, a prediction of hostsystem behavior with respect to subsequent access of the storage mediaby the host system; and

scheduling, based on the prediction of host system behavior, the hostI/Os for access to the storage media of the storage system.

Example 2: The method of claim 1, further comprising:

determining that the storage system has internal I/Os that are pending,and wherein the scheduling comprises:

scheduling, based on the prediction of host system behavior, the hostI/Os of the host system and the internal I/Os of the storage system foraccess to the storage media of the storage system.

Example 3: The method of claim 2, wherein scheduling the internal I/Osof the storage system comprises advancing or delaying the internal I/Osof the storage system based on the prediction of host system behavior tomitigate contention between the internal I/Os and the host I/Os orsubsequent host I/Os for access to the storage media.

Example 4: The method of claim 2, wherein the internal I/Os correspondto one or more tasks of a Flash translation layer of the storage systemthat includes one of garbage collection, data migration, or wearleveling.

Example 5: The method of claim 1, wherein the prediction of host systembehavior received from the artificial intelligence engine comprises anindication of:

a duration of time until the host system becomes idle;

a duration of time for which the host system will remain idle; or

parameters regarding a next host I/O issued by the host system.

Example 6: The method of claim 1, wherein the information describing thehost I/Os comprises, for at least one of the host I/Os, an indicationof:

an event type of the host I/O;

an event duration of the host I/O; or

an event size of data associated with the host I/O.

Example 7: The method of claim 1, wherein the scheduling of the hostI/Os is based on the prediction of host system behavior received fromthe artificial intelligence engine and device-level parameters of thestorage media for thermal management of the storage media.

Example 8: The method of claim 1, wherein:

the artificial intelligence engine executes multiple artificialintelligence models;

at least two of the multiple artificial intelligence models areassociated with respective internal tasks implemented by a Flashtranslation layer or device-level manager of the storage system; and

the method further comprises loading, prior to providing the informationto the artificial intelligence engine, at least one of the multipleintelligence models to the artificial intelligence engine to enable theprediction of the host system behavior.

Example 9: The method of claim 8, further comprising executing, via theartificial intelligence engine, the at least two of the multipleartificial intelligence models concurrently to implement at least twoartificial intelligence-assisted internal tasks of the storage system.

Example 10: The method of claim 8, further comprising executing, via theartificial intelligence engine, two instances of one of the multipleartificial intelligence models in parallel to enable online re-trainingor refinement of the artificial intelligence model.

Example 11: An apparatus comprising:

a host interface configured for communication with a host system;

storage media to store data of the host system;

a media interface configured to enable access to the storage media;

an artificial intelligence engine; and

a media access manager configured to:

receive, via the host interface, host input/outputs (I/Os) from the hostsystem for access to the storage media of the apparatus;

provide, to the artificial intelligence engine, information describingthe host I/Os received from the host system;

receive, from the artificial intelligence engine, a prediction of hostsystem behavior with respect to subsequent access of the storage mediaby the host system; and

schedule, based on at least the prediction of host system behavior, thehost I/Os for access to the storage media of the apparatus.

Example 12: The apparatus of claim 11, wherein the media access manageris further configured to:

determine that the apparatus has internal I/Os for access to the storagemedia that are pending; and

schedule, based on the prediction of host system behavior, the host I/Osof the host system and the internal I/Os of the apparatus for access tothe storage media of the apparatus.

Example 13: The apparatus of claim 11, wherein:

the prediction of host system behavior received from the artificialintelligence engine comprises an indication of at least one of aduration of time until the host system becomes idle, a duration of timefor which the host system will remain idle, or parameters regarding anext host I/O issued by the host system; or

the information describing the host I/Os comprises, for at least one ofthe host I/Os, an indication of an event type of the host I/O, an eventduration of the host I/O, or an event size of data associated with thehost I/O.

Example 14: The apparatus of claim 11, wherein:

the artificial intelligence engine executes multiple artificialintelligence models that include at least two of the multiple artificialintelligence models are associated with respective internal tasksimplemented by a Flash translation layer or device-level manager of theapparatus; and

the media access manager is further configured to load, prior toproviding the information to the artificial intelligence engine, atleast one of the multiple intelligence models to the artificialintelligence engine to enable the prediction of the host systembehavior.

Example 15: The apparatus of claim 14, wherein the media access manageris further configured to:

cause the artificial intelligence engine to execute the at least two ofthe multiple artificial intelligence models concurrently to implement atleast two artificial intelligence-assisted internal tasks of theapparatus; or

cause the artificial intelligence engine to execute two instances of oneof the multiple artificial intelligence models in parallel to enableonline re-training or refinement of the artificial intelligence model.

Example 16: A System-on-Chip (SoC) comprising:

a media interface to access storage media of a storage system;

a host interface to communicate with a host system;

an artificial intelligence engine;

a hardware-based processor;

a memory storing processor-executable instructions that, responsive toexecution by the hardware-based processor, implement a media accessmanager to:

receive, via the host interface, host input/outputs (I/Os) from the hostsystem for access to the storage media of the storage system;

provide, to the artificial intelligence engine, information describingthe host I/Os received from the host system;

receive, from the artificial intelligence engine, a prediction of hostsystem behavior with respect to subsequent access of the storage mediaby the host system; and

schedule, based on at least the prediction of host system behavior, thehost I/Os for access to the storage media of the storage system.

Example 17: The SoC of claim 16, wherein the media access manager isfurther configured to determine that the SoC has internal I/Os foraccess to the storage media that are pending; and

schedule, based on the prediction of host system behavior, the host I/Osof the host system and the internal I/Os of the SoC for access to thestorage media of the SoC, the scheduling of the internal I/Os comprisingadvancing or delaying the internal I/Os of the SoC to mitigatecontention between the internal I/Os and the host I/Os.

Example 18: The SoC of claim 16, wherein:

the prediction of host system behavior received from the artificialintelligence engine comprises an indication of at least one of aduration of time until the host system becomes idle, a duration of timefor which the host system will remain idle, or parameters regarding anext host I/O issued by the host system; or

the information describing the host I/Os comprises, for at least one ofthe host I/Os, an indication of an event type of the host I/O, an eventduration of the host I/O, or an event size of data associated with thehost I/O.

Example 19: The SoC of claim 16, the media access manager is furtherconfigured to schedule the host I/Os based on the prediction of hostsystem behavior and device-level parameters of the storage media forthermal management of the storage media.

Example 20: The SoC of claim 16, wherein:

the artificial intelligence engine of the SoC executes multipleartificial intelligence models that include at least two of the multipleartificial intelligence models associated with respective internal tasksimplemented by a Flash translation layer or device-level manager of theSoC; and

the media access manager is further configured to:

cause the artificial intelligence engine to execute the at least two ofthe multiple artificial intelligence models concurrently to implement atleast two artificial intelligence-assisted internal tasks of the SoC; or

cause the artificial intelligence engine to execute two instances of oneof the multiple artificial intelligence models in parallel to enableonline re-training or refinement of the artificial intelligence model.

What is claimed is:
 1. A method for artificial intelligence-enabledmanagement of storage media access, comprising: receiving, at a storagemedia system, multiple host input/outputs (I/Os) from a host system foraccess to storage media of the storage media system; providing, to anartificial intelligence engine associated with the storage media system,respective information for individual host I/Os of the multiple hostI/Os that indicates an access-type of a host I/O and an amount of dataassociated with the host I/O; receiving, from the artificialintelligence engine and based on the respective information for theindividual host I/Os, a prediction of future host system I/O activitywith respect to subsequent access of the storage media system by thehost system; scheduling first media I/Os for access to the storage mediaof the storage system that correspond to the multiple host I/Os; andscheduling, based on the prediction of future host system I/O activity,second media I/Os for storage media access that correspond to internalI/Os of the storage media system for access to the storage media.
 2. Themethod of claim 1, wherein the scheduling comprises scheduling thesecond media I/Os that correspond to the internal I/Os of the storagemedia system to avoid contention between the internal I/Os and at leastsome of the first media I/Os that correspond to the multiple host I/Os.3. The method of claim 1, wherein the scheduling comprises schedulingthe second media I/Os that correspond to the internal I/Os of thestorage media system to avoid contention between the internal I/Os andat least a third media I/O that corresponds to the next host I/O issuedby the host system.
 4. The method of claim 1, wherein the respectiveinformation indicating the access-type of the host I/O comprises, for atleast one of the multiple host I/Os, an indication of: an event type ofthe host I/O; an event duration of the host I/O; or an event size ofdata associated with the host I/O.
 5. The method of claim 1, wherein theprediction of future host system I/O activity received from theartificial intelligence engine comprises an indication of at least oneof: a duration of time until the at least one of the multiple host I/Osis completed; a duration of time until the host system becomes idle; aduration of time for which the host system will remain idle; orparameters regarding the next host I/O issued by the host system.
 6. Themethod of claim 1, wherein the storage media system comprises one of astorage device, a storage drive, a storage array, a storage volume, anetwork-attached storage (NAS) device, a data center, a server, avirtualized storage system, cloud-based storage, a solid-state storagedrive SSD, a non-volatile memory express (NVMe) SSD, or a peripheralcomponent interconnect express (PCIe) SSD.
 7. The method of claim 1,wherein the method is implemented by a storage controller of the storagemedia system, the storage media controller comprising a first interfaceconfigured to communicate with the host system and a second interfaceconfigured to access the storage media of the storage media system. 8.An apparatus comprising: a host interface configured for communicationwith a host system; a media interface configured to enable access tostorage media operably coupled to the apparatus; an artificialintelligence engine; and a storage media controller configured to:receive, via the host interface, host input/outputs (I/Os) from the hostsystem for access to the storage media; provide, to the artificialintelligence engine, information describing the host I/Os received fromthe host system; receive, from the artificial intelligence engine, aprediction of future host system I/O activity with respect to subsequentaccess of the storage media by the host system; determine that theapparatus has internal I/Os for access to the storage media that arepending; schedule first media I/Os for storage media access thatcorrespond to the host I/Os for access to the storage media; andschedule, based on the prediction of future host system I/O activity,second media I/Os for storage media access that correspond to theinternal I/Os for access to the storage media.
 9. The apparatus of claim8, wherein to schedule the second media I/Os, the storage mediacontroller is further configured to: schedule the second media I/Os thatcorrespond to the internal I/Os to avoid contention between the internalI/Os and at least some of the first media I/Os that correspond to thehost I/Os; or schedule the second media I/Os that correspond to theinternal I/Os to avoid contention between the internal I/Os and at leasta third media I/O that corresponds to the next host I/O issued by thehost system.
 10. The apparatus of claim 8, wherein the informationdescribing the host I/Os comprises, for at least one of the host I/Os,an indication of: an event type of the host I/O; an event duration ofthe host I/O; or an event size of data associated with the host I/O. 11.The apparatus of claim 8, wherein the scheduling of the second mediaI/Os that correspond to the internal I/Os is based on: the prediction offuture host system I/O activity received from the artificialintelligence engine; and device-level parameters of the storage mediafor thermal management of the storage media.
 12. The apparatus of claim8, wherein the artificial intelligence engine executes multipleartificial intelligence models that include at least two of the multipleartificial intelligence models are associated with respective internaltasks implemented by a Flash translation layer or a device-level managerof the storage media; and the storage media controller is furtherconfigured to load, prior to providing the information to the artificialintelligence engine, at least one of the multiple intelligence models tothe artificial intelligence engine to enable the prediction of thefuture host system I/O activity.
 13. The apparatus of claim 8, furthercomprising at least a portion of the storage media and where in theapparatus is configured as a storage media system.
 14. The apparatus ofclaim 13, wherein the storage media system comprises one of a storagedevice, a storage drive, a storage array, a storage volume, anetwork-attached storage (NAS) device, a data center, a server, avirtualized storage system, cloud-based storage, a solid-state storagedrive SSD, a non-volatile memory express (NVMe) SSD, or a peripheralcomponent interconnect express (PCIe) SSD.
 15. An apparatus comprising:a media interface configured to access storage media of a storagesystem; a host interface configured to communicate with a host system;an artificial intelligence engine; a hardware-based processor; a memorystoring processor-executable instructions that, responsive to executionby the hardware-based processor, implement an artificialintelligence-enabled (AI-enabled) storage controller to: receive, viathe host interface, host input/outputs (I/Os) from the host system foraccess to the storage media of the storage system; provide, to theartificial intelligence engine, information describing the host I/Osreceived from the host system; receive, from the artificial intelligenceengine, a prediction of future host system I/O activity with respect tosubsequent access of the storage media by the host system; schedulefirst media I/Os for storage media access that correspond to the hostI/Os for access to the storage media; and schedule, based on theprediction of future host system I/O activity, second media I/Os forstorage media access that correspond to internal I/Os of the apparatusfor access to the storage media.
 16. The apparatus of claim 15, whereinto schedule the second media I/Os, the AI-enabled storage mediacontroller is further configured to: schedule the second media I/Os thatcorrespond to the internal I/Os to avoid contention between the internalI/Os and at least some of the first media I/Os that correspond to thehost I/Os; or schedule the second media I/Os that correspond to theinternal I/Os to avoid contention between the internal I/Os and at leasta third media I/O that corresponds to the next host I/O issued by thehost system.
 17. The apparatus of claim 15, wherein the AI-enabledstorage controller is further implemented to provide, to the artificialintelligence as part of the respective information for the individualhost I/Os, at least one of: respective information that describes alogical block address for access requested by the host I/O; respectiveinformation that describes a duration of time associated processing hostI/O; or respective information that describes a time at which the hostI/O is received from the host system.
 18. The apparatus of claim 15,wherein the artificial intelligence engine of the apparatus isconfigured to define as part of the prediction of future host systemidle time one of: a duration of time that is predicted to lapse beforecompletion of the at least one of the multiple host I/Os; a duration oftime that is predicted to lapse before the host system enters an idlestate after completion of the multiple host I/Os; a duration of time forwhich the host system is predicted to remain idle after completion ofthe multiple host I/Os; or predicted parameters for the next host I/Oissued by the host system.
 19. The apparatus of claim 15, furthercomprising: the storage media that is coupled to the media interface ofthe apparatus; and wherein the apparatus is embodied as part of thestorage media system or a storage media drive of the storage mediasystem.
 20. The apparatus of claim 19, wherein the storage media systemcomprises one of a storage device, a storage drive, a storage array, astorage volume, a network-attached storage (NAS) device, a data center,a server, a virtualized storage system, cloud-based storage, asolid-state storage drive SSD, a non-volatile memory express (NVMe) SSD,or a peripheral component interconnect express (PCIe) SSD.